Probabilistic framework for solving visual dialog

نویسندگان

چکیده

In this paper, we propose a probabilistic framework for solving the task of ‘Visual Dialog’. Solving requires reasoning and understanding visual modality, language common sense knowledge to answer. Various architectures have been proposed solve by variants multi-modal deep learning techniques that combine representations. However, believe it is crucial understand analyze sources uncertainty task. Our approach allows estimating also aids diverse generation answers. The obtained through representation module provides us with representations image, question conversation history, ensures latent candidate answers are given an chooses appropriate answer minimizes uncertainty. We thoroughly evaluate model detailed ablation analysis, comparison state art visualization in method. Using framework, thus obtain improved dialog system more explainable.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Frame-Based Probabilistic Framework for Spoken Dialog Management Using Dialog Examples

This paper proposes a probabilistic framework for spoken dialog management using dialog examples. To overcome the complexity problems of the classic partially observable Markov decision processes (POMDPs) based dialog manager, we use a frame-based belief state representation that reduces the complexity of belief update. We also used dialog examples to maintain a reasonable number of system acti...

متن کامل

Probabilistic Dialog Management

Modeling user interfaces as dialogs provides a conceptual framework to address global coherence and efficiency of interactions. While non-probabilistic approaches provide convincing results and transparent dialog behavior, probabilistic techniques can help to account for inherent uncertainties in user input. In this paper, we present three patterns for probabilistic dialog management or support...

متن کامل

WizArg: Visual Argumentation Framework Solving Wizard

Extension-based argumentation semantics have shown to be a suitable approach for performing practical reasoning. An important concern in extensionbased-argumentation semantics is the computational complexity of the decision problems that has been shown to range from NP-complete to Π 2 -complete. In this paper, we introduce a generic extension-based argumentation semantics solver, that is called...

متن کامل

Interactive Visual Dialog

In this paper we propose a paradigm called the Interactive Visual Dialog (IVD) as a means of facilitating a system’s ability to recognize objects presented to it by a human. The presentation centers around a supermarket checkout scenario in which an operator presents an item to be tallied to a stationary television camera. An active vision approach is used to provide feedback to the operator in...

متن کامل

CoDraw: Visual Dialog for Collaborative Drawing

In this work, we propose a goal-driven collaborative task that contains vision, language, and action in a virtual environment as its core components. Specifically, we develop a collaborative ‘Image Drawing’ game between two agents, called CoDraw. Our game is grounded in a virtual world that contains movable clip art objects. Two players, Teller and Drawer, are involved. The Teller sees an abstr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Pattern Recognition

سال: 2021

ISSN: ['1873-5142', '0031-3203']

DOI: https://doi.org/10.1016/j.patcog.2020.107586